An Environment for Word Prominence Classification in Slovenian Language

نویسندگان

  • Slovenian Lang
  • Janez STERGAR
چکیده

Besides phrasing, prominence is one of the most important parameters of speech prosody to model. The so called data driven approaches nowadays seem to be the appropriate solution for prosody modeling in current text to speech (TTS) systems. They allow prosodic regularities to be automatically extracted from a prosodic database of natural speech. In this paper we’ll present an evaluation of suitability for automatic word prominence classification of Slovenian language with a hierarchical approach. We classified the prominence of words into two groups, characterized by pitch movements (pitch accent) and stress (stress accent). Pitch movements have been detected from the interpolated syllable pitch contour, while syllable stress was classified from the quantity of energy in the high band of vowel spectra. We’ll also present an examination of the correlation between the hand labeled prominent words and the extracted prosody features of the mentioned two classes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Fuzzy LR Numbers in Bayesian Text Classifier for Classifying Persian Text Documents

Text Classification is an important research field in information retrieval and text mining. The main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. Since word detection is a difficult and time consuming task in Persian language, Bayesian text classifier is an appropriate approach to deal with different...

متن کامل

Using Fuzzy LR Numbers in Bayesian Text Classifier for Classifying Persian Text Documents

Text Classification is an important research field in information retrieval and text mining. The main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. Since word detection is a difficult and time consuming task in Persian language, Bayesian text classifier is an appropriate approach to deal with different...

متن کامل

Jezikovno neodvisno modeliranje pregibnega jezika

This article concerns statistical language modelling of Slovenian language for automatic speech recognition. We investigate various techniques for overcoming the difficulties in modelling highly inflected languages. Slavic languages are particularly challenging languages and Slovenian language is one of them. Two main problems arise when modelling Slovenian language in comparison to English. Th...

متن کامل

A First Glimpse of Kanakanavu Word Prominence

This study investigated the word prominence pattern of Kanakanavu, a critically endangered Austronesian language spoken in Taiwan. Previous studies on the phonetic correlates of Piwan and Saisiyat agreed that pitch is the only consistent cue, indicating that Formosan languages are more like pitchaccent languages. However, given that word accents are in a fixed position for those two languages, ...

متن کامل

SI-PRON Pronunciation Lexicon: a New Language Resource for Slovenian

We present the efforts involved in designing SI-PRON, a comprehensive machine-readable pronunciation lexicon for Slovenian. It has been built from two sources and contains all the lemmas from the Dictionary of Standard Slovenian (SSKJ), the most frequent inflected word forms found in contemporary Slovenian texts, and a first pass of inflected word forms derived from SSKJ lemmas. The lexicon fil...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003